A Tensor Voting Approach for Multi-view 3D Scene Flow Estimation and Refinement
نویسندگان
چکیده
We introduce a framework to estimate and refine 3D scene flow which connects 3D structures of a scene across different frames. In contrast to previous approaches which compute 3D scene flow that connects depth maps from a stereo image sequence or from a depth camera, our approach takes advantage of full 3D reconstruction which computes the 3D scene flow that connects 3D point clouds from multi-view stereo system. Our approach uses a standard multi-view stereo and optical flow algorithm to compute the initial 3D scene flow. A unique two-stage refinement process regularizes the scene flow direction and magnitude sequentially. The scene flow direction is refined by utilizing 3D neighbor smoothness defined by tensor voting. The magnitude of the scene flow is refined by connecting the implicit surfaces across the consecutive 3D point clouds. Our estimated scene flow is temporally consistent. Our approach is efficient, model free, and it is effective in error corrections and outlier rejections. We tested our approach on both synthetic and realworld datasets. Our experimental results show that our approach outperforms previous algorithms quantitatively on synthetic dataset, and it improves the reconstructed 3D model from the refined 3D point cloud in real-world dataset.
منابع مشابه
Triangle Mesh-Based Surface Modeling Using Adaptive Smoothing and Implicit Texture Integration
This paper presents a framework of surface modeling from multi-view range data. The input to the algorithms are triangle meshes, each of which is from a single view range scan. The triangle meshes generated from raw data are first processed by the proposed area decreasing flow for surface denoising. Although the proposed flow is mathematically equivalent to the mean curvature flow, it can avoid...
متن کاملGo with the Flow: Hand Trajectories in 3D via Clustered Scene Flow
Tracking hands and estimating their trajectories is useful in a number of tasks, including sign language recognition and human computer interaction. Hands are extremely difficult objects to track, their deformability, frequent self occlusions and motion blur cause appearance variations too great for most standard object trackers to deal with robustly. In this paper, the 3D motion field of a sce...
متن کاملRegion Segmentation based on Gaussian Dirichlet Process Mixture Model and its Application to 3D Geometric Stricture Detection
In general, image-based 3D scenes can now be found in many popular vision systems, computer games and virtual reality tours. So, It is important to segment ROI (region of interest) from input scenes as a preprocessing step for geometric stricture detection in 3D scene. In this paper, we propose a method for segmenting ROI based on tensor voting and Dirichlet process mixture model. In particular...
متن کاملViewpoint-aware object detection and continuous pose estimation
We describe an approach to category-level detection and viewpoint estimation for rigid 3D objects from single 2D images. In contrast to many existing methods, we directly integrate 3D reasoning with an appearance-based voting architecture. Our method relies on a nonparametric representation of a joint distribution of shape and appearance of the object class. Our voting method employs a novel pa...
متن کاملMulti-view dense depth map estimation
A novel dense depth map estimation algorithm is proposed in order to meet the requirements of N-view plus N-depth representation, which is one of the standardization efforts for the upcoming 3D display technologies. Hence, extraction of multiple depth maps is achieved from multi-view video. Starting from the piecewise planarity assumption of the scene, estimation of 3D structure of the patches,...
متن کامل